Phylogenetic Inference using Genetic Algorithm-based Least Squares Methods
نویسندگان
چکیده
The only illustration in Darwin’s Origin of the Species shows a phylogenetic tree. This type of tree is used to represent the evolutionary relationship between different species. Phylogenetic inference is constructing such a tree based on the measured pairwise distance of all species. There exists many methods to evaluate how well a particular tree fits the measurements. One of these is called the least squares method, in which we wish to minimize the square of the error of all species between the measured distance and the distance in the tree. Because the search space of trees is huge, it is not possible to examine all possibilities; it is necessary to utilize a search heuristic. We have chosen to use genetic algorithms (GA) which handles and changes a population of trees. GAs have been used in several articles on similar problems, but we find that it is often applied superficially. For instance there is seldom spent time on choosing the GA’s operators, the initial population or how to choose the parameters of the GA. In this thesis we have therefore implemented a genetic algorithm to solve the phylogeny problem with respect to least squares, and we have examined how the initial population and in particular the parameters influences the efficiency of the GA. Our experiments have shown that a self-adaptive approach gives good results without the need for time consuming tuning of parameters, and that the GA can be quickened by basing the initial population on a faster heuristic. In addition, our GA has been able to find better results than the neighbor joining algorithm.
منابع مشابه
Robustness of phylogenetic inference based on minimum evolution.
Minimum evolution is the guiding principle of an important class of distance-based phylogeny reconstruction methods, including neighbor-joining (NJ), which is the most cited tree inference algorithm to date. The minimum evolution principle involves searching for the tree with minimum length, where the length is estimated using various least-squares criteria. Since evolutionary distances cannot ...
متن کاملEarthwork Volume Optimization Using Imperialistic Competitive Algorithm to Minimize Energy Consumption of Agricultural Land Leveling
Land leveling is one of the most important steps in soil preparation for consequent objectives. Parallel policies need to take both energy and environmental subjects into the account as well as certain financial development and eco-friendly protection. Energy is one of the most important elements in agricultural sector. Nevertheless, pollution is linked with the usage of fossil fuels (particul...
متن کاملSpoofing Mitigation of GPS Receiver using Least Mean Squares-Based Adaptive Filter
The Global Positioning System (GPS) signals are very weak signal over wireless channels, so they are vulnerable to in-band interferences. Therefore, even a low-power interference can easily spoof GPS receivers. Among the variety of GPS signal interference, spoofing is considered as the most dangerous intentional interference. The spoofing effects can mitigate with an appropriate strategy in the...
متن کاملDetection of Breast Cancer Progress Using Adaptive Nero Fuzzy Inference System and Data Mining Techniques
Prediction, diagnosis, recovery and recurrence of the breast cancer among the patients are always one of the most important challenges for explorers and scientists. Nowadays by using of the bioinformatics sciences, these challenges can be eliminated by using of the previous information of patients records. In this paper has been used adaptive nero fuzzy inference system and data mining techniqu...
متن کاملOn the consistency of the minimum evolution principle of phylogenetic inference
The goal of phylogenetic inference is the reconstruction of the evolutionary history of various biological entities (taxa) such as genes, proteins, viruses or species. Phylogenetic inference is of major importance in computational biology and has numerous applications ranging from the study of biodiversity to sequence analysis. Given a matrix of pairwise distances between taxa, the minimum evol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006